Annotation Guidelines for Questions under Discussion and Information Structure
نویسنده
چکیده
We present a pragmatic, i.e. meaning-based, method for the information-structural analysis of corpus data, which is built on the idea that for any assertion contained in a text (or transcript of spoken discourse) there is an implicit Question under Discussion (QUD) that determines which parts of the assertion are focused or backgrounded (and which ones are not-at-issue, i.e. not part of the assertion at all). We formulate a number of constraints which allow the analyst/annotator to derive QUDs from the previous or upcoming discourse context and demonstrate the method using corpus examples (of French, German and English). Since we avoid making reference to language-specific morphosyntactic or prosodic properties, we claim that our method is also cross-linguistically applicable beyond our example languages.
منابع مشابه
QUD-Based Annotation of Discourse Structure and Information Structure: Tool and Evaluation
We discuss and evaluate a new annotation scheme and discourse-analytic method, the QUD-tree framework. We present an annotation study, in which the framework, based on the concept of Questions under Discussion, is applied to English and German interview data, using TreeAnno, an annotation tool specially developed for this new kind of discourse annotation. The results of an inter-annotator agree...
متن کاملContent of Linguistic Annotation: Standards and Practices (CLASP) Research Activities and Findings
25 members of the computational linguistics research community participated in a meeting at New York University on November 7, 2009 to address several difficult questions about the standardization of linguistic content in corpus annotation, where we define the term standardization to include all efforts to improve compatibility or interoperability between annotation content, including not only ...
متن کاملMultimedia Annotation: Comparability of Gloss Modalities and their Implications for Reading Comprehension
This study compared the effects of two annotation modalities on the reading comprehension of Iranian intermediate level EFL learners. The two experimental groups under study received treatment on 10 academic L2 reading passages under one of two conditions: One group received treatment on key words in the reading passages through a multimedia environment providing textual annotations. The second...
متن کاملPhenotyping, association analysis and annotation of genes related to leaf wilting of bread wheat (Triticum aestivum L.) at the seedling stage under drought stress conditions
Rapid screening of plant germplasm in the early stages of growth and determining the genetic basis of wheat leaf wilting index at the seedling stage is necessary for wheat breeding programs. In the present research, leaf wilting index for 290 Iranian bread wheat genotypes, including; 90 cultivars and 200 landraces were studied under drought stress conditions at the seedling stage in 2021 in res...
متن کاملFocus Annotation in Reading Comprehension Data
When characterizing the information structure of sentences, the so-called focus identifies the part of a sentence addressing the current question under discussion in the discourse. While this notion is precisely defined in formal semantics and potentially very useful in theoretical and practical terms, it has turned out to be difficult to reliably annotate focus in corpus data. We present a new...
متن کامل